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Abstract 



We show that the subword complexity function p x (n), which counts the number of distinct 
factors of length n of a sequence x, is fc-synchronized in the sense of Carpi if x is fc-automatic. 
As an application, we generalize recent results of Goldstein. We give analogous results for the 
number of distinct factors of length n that are primitive words or powers. In contrast, we show 
that the function that counts the number of unbordered factors of length n is not necessarily 
f^) fc-synchronized for fc-automatic sequences. 
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1 Introduction 

We are concerned with the representation of integers in base fc, where fc > 2 is an integer. 
We let S/c = {0, 1, 2, . . . , k — 1}, and we let (n)k denote the canonical representation of n in 
base fc, starting with the most significant digit, and without leading zeroes. If x G we 
let [x] k denote the integer represented by x (where x is allowed to have leading zeroes) . To 



H-l 

^ represent a pair of integers (m, n), we use words over the alphabet EtxEj.. For such a word 

x, we let TTi(x) to be the projection onto the i'th coordinate. The canonical representation 
(to, n)k is defined to be the word x such that [7ri(x)]fc = to and [7T2(a;)]fc = n, and having no 
> leading [0,0] 's. For example (43, 17) 2 = [1, 0][0, 1][1, 0][0, 0][1, 0][1, 1]. 

Recently, Arturo Carpi and his co-authors HI [S] introduced a very interesting class of 
sequences that are computable by automata in a novel fashion: the class of fc-synchronized 
sequences. Let (/(n)) n >o be a sequence taking values in N. They call such a sequence 
\q k- synchronized if there is a deterministic finite automaton M accepting the base-fc repre- 

sentation of the graph of /, namely {(n, f(n)) k : n > 0}. 

Sequences that are fc-synchronized are "halfway between" the class of fc-automatic se- 
quences, introduced by Cobham 9 and studied in many papers; and the class of fc-regular 
sequences, introduced by Allouche and Shallit [HH]- They are particularly interesting for 
two reasons. If a sequence (f(n)) is fc-synchronized, then 

(a) we immediately get a bound on its growth rate: f{n) — 0(n)\ 

(b) we immediately get a linear-time algorithm for efficiently calculating /(n). 

Result (a) can be found in [SJ Prop. 2.5]. We now state and prove result (b). 

► Theorem 1. Suppose (/(?t))n>o is k- synchronized. Then there is an algorithm that, given 
the base-k representation of n, will compute the base-k representation of f(n) in O(logn) 
time. 

Proof. We know there is a DFA M = (Q, E fe x $, qo,F) accepting L = {(n, f(n)) k : n > 
0}. From result (a) above we know that f(n) < Cn, for some constant C, so if (n,f(n)) k 
is accepted, then the first component is s w for some s < log/,, C, where w is the canonical 
base-fc representation of n. Let N = s + \w\. We now create a directed graph out of N + 1 
copies of the transition graph for M, by starting at the final states of M and tracing a path 
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backwards, using the reversed transitions of M. This path is chosen so the first component 
of the labels encountered form w R s and the second component is arbitrary. The reslting 
graph has at most O(N) transitions and vertices. There will be only one path of length / 
with \w\ < I < N that leads to the initial state qo, and this can be found with depth- first 
search in O(N) time. Then, reading the corresponding labels of the second components in 
the forward direction gives the base-fc representation of f(n). -4 

In this paper, we are concerned with infinite words over a finite alphabet. Let x = 
aoaia.2 • • ■ be an infinite word. By x[m..n] we mean the factor a m a m +i ■ • • a n of x of length 
n — m+ 1. The subword complexity function p x (n) counts the number of distinct factors of 
length n. 

An infinite word or sequence x is said to be fc-automatic if there is an automaton with 
outputs associated with the states that, on input (ro)fc, reaches a state with output x[n]. In 
this paper we show that if x is a fc-automatic sequence, then the subword complexity p x (n) 
is fc-synchronized. As an application, we generalize and simplify recent results of Goldstein 
|13l 1 14] . Furthermore, we obtain analogous results for the number of length- n primitive 
words and the number of length-n powers. 

We remark that there are a number of quantities about fc-automatic sequences already 
known to be fc-synchronized. These include 

the separator sequence of a non-ultimately-periodic fc-automatic sequence [B] ; 

the repetitivity index of a fc-automatic sequence [I]; 
h the recurrence function of a fc-automatic sequence [5] ; 

the "appearance" function of a fc-automatic sequence [5]. 
The latter two examples were not explicitly stated to be fc-synchronized in [8], but the result 
follows immediately from the proofs in that paper. 

2 Subword complexity 

Cobham [S] proved that if x is a fc-automatic sequence, then p x (n) — 0(n). Cassaigne [7] 
proved that any infinite word x satisfying p x (n) — 0(n) also satisfies p x (n + 1) — p x (n) — 
0(1). Carpi and D'Alonzo [5] showed that the subword complexity function p x (n) is a 
fc-regular sequence. 

Charlicr, Rampersad, and Shallit [8 found this result independently, using a somewhat 
different approach. They used the following idea. Call an occurrence of the factor t — 
+ n — 1] "novel" if t does not appear as a factor of x[0..i + n — 2]. In other words, 
the leftmost occurrence of t in x is at position i. Then the number of factors of length n 
in x is equal to the number of novel occurrences of factors of length n. The property that 
x[j..i + n — 1] is novel can be expressed as a predicate, as follows: 

{{n,i) k : Vj,0 <j < i x[i..i + n- 1] ^x\j..j + n- 1]} = 

{(n, i)k ■ Vj, < j < i 3m, < m < n x[i + m] ^ x\j + rn]}. (1) 

As shown in [5], the base-fc representation of the integers satisfying any predicate of 
this form (expressible using quantifiers, integer addition and subtraction, indexing into a fc- 
automatic sequence x, logical operations, and comparisons) can be accepted by an explicitly- 
constructable deterministic finite automaton. From this, it follows that the sequence p x (n) is 
fc-regular, and hence can be computed explicitly in terms of the product of certain matrices 
and vectors depending on the base-fc expansion of n. 
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We show that, in fact, the subword complexity function p x (n) is /c-synchronized. The 
main observation needed is the following (Theorem[3]): in any sequence of linear complexity, 
the novel occurrences of factors are "clumped together" in a bounded number of contiguous 
blocks. This makes it easy to count them. 

More precisely, let x be an infinite word and for any n consider the set of novel occurrences 

E x (n) :— {i : the occurrence x[i..i + n — 1] is novel }. 

We consider how E x (n) evolves with increasing n. 
As an example, consider the Thue-Morse sequence 

t = t txt 2 ■■■ = 0110100110010110 

defined by letting t n be the number of l's in the binary expansion of n, taken modulo 2. 
The gray squares in the rows of of Figure [T] depict the members of Et(n) for the Thue-Morse 
sequence for 1 < n < 9. 



t[i] 

n = 1 
2 
3 
4 
5 
6 
7 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 
0110100110010110100101100 



Figure 1 Evolution of novel occurrences of factors in the Thue-Morse sequence 



► Lemma 2. Let x be an infinite word. If the factor of length n beginning at position i is a 
novel occurrence, so is 

(a) the factor of length n+ 1 beginning at position i; 

(b) the factor of length n + 1 beginning at position i — 1 (for i > X). 

Proof, (a) Suppose the factor of length n + 1 also occurs at some position j < i. Then 
the factor of length n also occurs at position j, contradicting the fact that it was a novel 
occurrence at i. 

(b) Suppose the factor of length n + 1 beginning at position i — 1 occurs at some earlier 
position j < i — 1. We can write the factor as ax, where a is a single letter and x is a word, 
so the factor of length n beginning at position i must also occur at position j + 1 < i. But 
then it is not a novel occurrence. -4 

► Theorem 3. Let x be an infinite word. For n > I, the number of contiguous blocks in 
E^(n) is at most p x (n) — p^{n — 1) + 1. 

Proof. We prove the claim by induction on n. For n — 1 the claim says there are at most 
p x (l) contiguous blocks, which is evidently true, since there are at most p x (l) novel factors 
of length 1 . 
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Now assume the claim is true for all n' < n; we prove it for n. Consider the evolution of 
the novel occurrences of factors in going from length n — 1 to n. Every occurrence that was 
previously novel is still novel, and furthermore in every contiguous block except the first, we 
get novel occurrences at one position to the left of the beginning of the block. So if row n — 1 
has t contiguous blocks, then we get t — 1 novel occurrences at the beginning of each block, 
except the first. (Of course, the first block begins at position 0, since any factor beginning 
at position is novel, no matter what the length is.) The remaining p(n) — p(n — 1) — (t — 1) 
novel occurrences could be, in the worst case, in their own individual contiguous blocks. 
Thus row n has at most t + p(n) — pin — 1) — (t — 1) = p(n) + p(n — 1) + 1 contiguous 
blocks. -4 

In our Thue-Morse example, it is well-known that pt(n) — pt(n — 1) < 4, so the number 
of contiguous blocks in any row is at most 5. This is achieved, for example, for n = 6. 

► Example 4. We give an example of a recurrent infinite word over a finite alphabet where 
the number of contiguous blocks in E x (n) is unbounded. Consider the word 

w = Y[(n) 2 = 110111001011101111000- ■■ . 

n>l 

Then for each n > 5 the first occurrence of each of the words n 1, n_2 ll, . . . ,0 2 l n_2 
have a non-novel occurrence immediately following them, which shows there at at least n — 2 
blocks in E w (n). 

► Corollary 5. If p x (n) — 0(n), then there is a constant C such that every row E x (n) in 
the evolution of novel occurrences consists of at most C contiguous blocks. 

Proof. By the result of Cassaigne [7], we know that there exists a constant C such that 
Px(n) - p x (n - 1) < C - I. By Theorem|3] we know there are at most C contiguous blocks 
in any E x (n). -4 

► Theorem 6. let x be a k-automatic sequence. Then its subword complexity function p x (n) 
is k- synchronized. 

Proof. Following [H], it suffices to show how to accept the language 

{(n,m)k ■ n>0 and m = p x (n)} 

with a finite automaton. Here is a sketch of the argument. From our results above, we know 
that there is a finite constant C > 1 such that the number of contiguous blocks in any row of 
the factor evolution diagram is bounded by C . So we simply "guess" the endpoints of every 
block and then verify that each factor of length n starting at the positions inside blocks is 
a novel occurrence, while all other factors are not. Finally, we verify that m is the sum of 
the sizes of the blocks. 

To fill in the details, we observe above in ([TJ that the predicate "the factor of length 
n beginning at position i of x is a novel occurrence" is solvable by a finite automaton. 
Similarly, given endpoints a, b and n, the predicates "every factor of length n beginning at 
positions a through b is a novel occurrence" and "no factor of length n beginning at positions 
a through b is a novel occurrence" are also solvable by a finite automaton. The length of 
each block is just b — a + 1, and it is easy to create an automaton that will check if the sums 
of the lengths of the blocks equals m, which is supposed to be p x (n). -4 

Applying Theorem [l] we get 
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► Corollary 7. Given a k-automatic sequence x, there is an algorithm that, on input n in 
base k, will produce p x (n) in base k in time O(logn). 

As another application, we can recover and improve some recent results of Goldstein [131 
14 . He showed how to compute the quantities limsup n>1 p x {n) /n and liminf„>! p x (n)/n for 
the special case of fc-automatic sequences that are the fixed points of fc-uniform morphisms 
related to certain groups. Corollary [8] below generalizes these results to all fc-automatic 
sequences. 

► Corollary 8. There is an algorithm, that, given a k-automatic sequence x, will compute 
su P» t >i Px(n)/n, limsup n > x p x (n)/n, and inf„>i p x (n)/n, liminf„>i p x (n)/n. 

Proof. We already showed how to construct an automaton accepting {(n, p x (n))k ■ n > 1}. 
Now we just use the results from [T7J Notice that the limsup corresponds to what is 
called the largest "special point" in [15]. -4 



► Example 9. Continuing our example of the Thue-Morse sequence, Figure [2] displays a 
DFA accepting 

{(n,pt(ri))k : n > 0}. 

Inputs are given with the most significant digit first; the "dead" state and transitions leading 
to it are omitted. 

Given an infinite word x, we can also count the number of contiguous blocks in each E x (n) 
for n > 0. (For the Thue-Morse sequence this gives the sequence 1,1, 2, 1,3, 1,5, 3, 3,1,....) 
If x is fc-automatic, then this sequence is also, as the following theorem shows: 

► Theorem 10. If x is k-automatic then the sequence (e(n)) n >o counting the number of 
contiguous blocks in the n'th step E x (n) of the evolution of novel occurrences of factors in 
x is also k-automatic. 

Proof. Since we have already shown that the number of contiguous blocks is bounded by 
some constant C if x is fc-automatic, it suffices to show for each i < C we can create an 
automaton to accept the language {(n)k ■ E x (n) has exactly i contiguous blocks }. To do 
so, on input n in base fc we guess the endpoints of the i contiguous nonempty blocks, verify 
that the length-n occurrences at those positions are novel, and that all other occurrences 
are not novel. -4 
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Figure 2 Automaton computing the subword complexity of the Thue-Morse sequence 

► Example 11. Figure [3]below gives the automaton computing the number e(n) of contigu- 
ous blocks of novel occurrences of length- ro factors for the Thue-Morse sequence. Here is a 
brief table: 



n 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


e(n) 


1 


1 


2 


1 


3 


1 


5 


3 


3 


1 


5 


5 


5 


3 


3 



3 Implementation 

We wrote a program that, given an automaton generating a fc-automatic sequence x, will 
produce a deterministic finite automaton accepting the language {(ro, p x (n))k '■ n > 0}. We 
used the following variant which does not require advance knowledge of the bound on the 
first difference of p x (n): 

1. Construct an automaton R that accepts (n, s,e, £) if, for factors of length n, the next 
contiguous block of novel occurrences after position s ends at position e and has length 
I. If there are no blocks past s, accept (n, s, s, 0). 

2. Construct an automaton Mo that accepts (n, 0, 0). 

3. Construct an automaton Mj + \ that accepts (n, S, e) if there exist s and S' such that 

(i) Mj . accepts (n, 5", s) 

(ii) R accepts (n, s, e, S — S"). 
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4. If Mj + i = Mj then we are done. We create an automaton that accepts (n, S) if there 
exists e such that Mj accepts (n, S, e). 




Figure 3 Automaton computing number of contiguous blocks of novel occurrences of length-n 
factors in the Thue-Morse sequence 

Besides the automaton depicted in Figure [T] we ran our program on the paperfolding 
sequence [TT] and the so-called "period-doubling sequence" [TU]. The results are depicted 
below in Figures [4] and [5] 

4 Powers and primitive words 

Let wbea nonempty word. We say if is a power if there exists a word x and an integer k > 2 
such that w = x k ; otherwise we say w is primitive. Given a word z, there is a unique way 
to write it as y l , where y is primitive and i is an integer > 1; this y is called the primitive 
root of z. Thus, for example, the primitive root of murmur is mur. 

We say w = a% ■ ■ ■ a n has a period p if a, = a.; +p for 1 < i < n — p. Thus, for example, 
alfalfa has period 3. It is easy to see that a word w is a power if and only if it has a period 
p such that p < \w\ and p | 

Two finite words x, y are conjugates if one is a cyclic shift of the other; in other words, if 
there exist words u, v such that x = uv and y = vu. For example, enlist is a conjugate of 
listen. As is well-known, every conjugate of a power of a word a; is a power of a conjugate 
of x. The lexicographically least conjugate of a primitive word is called a Lyndon word. We 
call the lexicographically least conjugate of the primitive root of x the Lyndon root of x. 

The following lemma says that if we consider the starting positions of length-n powers in 
a word x, then there must be large gaps between contiguous blocks of such starting positions. 

► Lemma 12. Let z be a finite or infinite word, and let n > 2 be an integer. Suppose there 
exist integers i,j such that 

(a) wi :— z[i..i + n — 1] is a power; 

(b) ui2 '■= z[j..j + ?i — l] is a power; 

(c) i < j < i + n/3. 

Then z[t..t — n — 1] is a power for i <t < j. Furthermore, if x\ is the Lyndon root of w\, 
then X\ is also the Lyndon root of each word z[t..t — n — 1]. 
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Figure 4 Automaton computing the subword complexity of the paperfolding sequence 
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Figure 5 Automaton computing the subword complexity of the period-doubling sequence 
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Proof. Let x\ be the primitive root of w± and X2 be the primitive root of W2- Since X\ and 
X2 are powers, there exist integers pi,P2 > 2 such that w\ = x^ 1 and u>2 — x^ 2 - 

Since w\ and W2 are both of length n, and since their starting positions are related by 
i < j < i + n/3, it follows that the word v :— z[j..i + n — 1] is common to both w\ and u^, 
and | TJ | = i + n — j > i + 2n/3 + n/3 — j > 2n/3. 

Now there are three cases to consider: 

(a) \xi\ > \x 2 \; 

(b) |a:i| < |x 3 |; 

(c) \ Xl \ = \x 2 \. 

Case (a): We must have P2 > Pi > 2, so P2 > 3. Since v is a suffix of W\, it has period 
l^i I < n/2. Since v is a prefix of u>2, it has period \x2\ < n/3. Then \v\ > 2n/3 > |a;i | + |x2|. 
By a theorem of Fine and Wilf [T^], it now follows that v, and hence x\, has period p := 
gcd(|2;i|, \x2\) < \x2\ < \xi\. Now p is less than |xi| and also divides it, so this means x\ is 
a power, a contradiction, since we assumed x\ is primitive. So this case cannot occur. 

Case (b) gives a similar contradiction. 

Case (c): We have p\ = P2 ~> 2. Then the last occurrence of x\ in w\ lies inside x|, and 
so x\ is a conjugate of x%. Hence w± is a conjugate of W2- It now follows that z[t..t + n — 1] 
is a conjugate of w± for every t, i < t < j . But the conjugate of a power is itself a power, 
and we are done. -4 



► Remark. The bound of n/3 in the statement of Lemma 12 is best possible, as shown by 
the following class of examples. Let h be the morphism that maps 1—^21 and 2 — > 22, and 
consider the word 

/i i (122122121212). 

This word is of length 12 • 2% and contains squares of length 3 • 2 4+1 starting in the first 3 • 2 l 
positions, and cubes of length 3 • 2 4+1 ending in the last 2 l + 1 positions. This achieves a 
gap of n/3 + 1 infinitely often. 

Now, given an infinite word x, we define a function a x (n), the appearance function, to 
be the least index i such that every length-/; factor of x appears in the prefix x[0../ + n — 1]; 
see [3 §10.10]. 

► Theorem 13. //x is a k-automatic sequence, then a x (n) = O(n). 

Proof. First, we show that the appearance function is fc-synchronized. It suffices to show 
that there is an automaton accepting {(n,m)fc : m = a x (n)}. To see this, note that on 
input (n,m)fc we can check that 

for all z > there exists j, < j < m such that x.[i..i + n — 1] = + n — 1]; and 

_ for all I < m we have x[m..m + n — 1] =/= x[l..l + n — 1]. 
From [6, Prop. 2.5] we know ^-synchronized functions are O(n). 

As before, we consider maximal blocks of novel occurrences of length-n powers in x. Our 
goal is to prove 

► Lemma 14. Ifx is k-automatic, then there are only a constant number of such blocks. 

Proof. To begin with, we consider maximal blocks of length-n powers in x (not considering 
whether they are novel occurrences). From Theorem 13 we know that every length-n factor 
must occur at a position < Cn, for some constant C (depending on x). We first argue that 
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the number of maximal blocks of length-n powers, up to the position of the last length-n 
power to occur for the first time, is at most 3C. 

Suppose there > 3C + 1 such blocks. Then Lemma [12] says that any two such blocks 
must be separated by at least n/3 positions. So the first occurrence of the last factor to 
occur occurs at a position > (3C)(n/3) = Cn, a contradiction. 

So using a constant number of blocks, in which each position of each block starts a 
lcngth-7i factor that is a power, we cover the starting positions of all such factors. It now 
remains to process these blocks to remove occurrences of length-n powers that are not novel. 

The first thing we do is remove from each block the positions starting length-n factors 
that have already occurred in that block. This has the effect of truncating long blocks. The 
new blocks have the property that each factor occurring at the starting positions in the 
blocks never appeared before in that block. 

Above we already proved that inside each block, the powers that begin at each position 
are all powers of some conjugate of a fixed Lyndon word. Now we process the blocks 
associated with the same Lyndon root together, from the first (leftmost) to the last. At 
each step, we remove from the current block all the positions where length-n factors begin 
that have appeared in any previous block. When all blocks have been processed, we need 
to see that there are still at most a constant number of contiguous blocks remaining. 

Suppose the associated Lyndon root is y, with \y\ = d. Each position in a block is the 
starting position of a power of a conjugate of y, and hence corresponds to a right rotation 
of y by some integer i, < i < d. Thus each block Bj actually corresponds to some Ij that 
is a contiguous subblock of 0, 1, . . . , d — 1 (thought of as arranged in a circle). 

As we process the blocks associated with y from left to right we replace Ij with Ij := 
Ij — (Ix U • • • U Ij-i)- Now if I C {0, 1, . . . , d — 1} is a union of contiguous subblocks, let 
#1 be the number of contiguous subblocks making up /. We claim that 

#7{ + #7^ + ... + #/; + #( |J I' l )<2n. (2) 

l<j<n 

To see this, suppose that when we set I' n := I n — (I\ U ■ • • U I n -i), the subblock I n 
has an intersection with t of the lower-numbered subblocks. Forming the union (Ui<;< n AO 
then obliterates t subblocks and replaces them with 1. But I' n has t — 1 new subblocks, plus 
at most 2 at either edge (see Figure [6]) . This means that the left side of ^ increases by at 
most (1 — t) + (t — 1) + 2 = 2. Doing this n times gives the result. 

Ui<i<j-i li 

h I I I I 

i'i I I I I I I I I 1 I 

Figure 6 How the number of blocks changes 

Now at the end of the procedure there will be at least one interval in the union of all the 
Ii, so #/( + #I£ H h #74 < 2n - 1, and we have proved Q. 

Earlier we showed that there are at most 3C maximal blocks of length-n powers, up to 
the position of the last length-n power to occur for the first time. Then, after processing 
these blocks to remove positions corresponding to factors that occurred earlier, we will have 
at most 2(3(7) = 6C blocks remaining. M 
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► Corollary 15. //x is k-automatic, then 

the function counting the number of distinct length-n factors that are powers is k- 
synchronized; 

the function counting the number of distinct length-n factors that are primitive words is 
k- synchronized. 

Proof. Suppose x is fc-automatic, and generated by the DFAO M. From the Lyndon- 
Schiitzenberger theorem |15j . we know that a word x is a power if and only if there exist 
nonempty words y, z such that x — yz — zy. Thus, we can express the predicate P(i,j) := 
"x[i..j] is a power" as follows: "there does not exist d, < d < j — i + 1, such that 
x[i..j — d] = x[i + d..j] and x[j — d + = x[i..i + d — 1]". Furthermore, we can express 
the predicate P'(i, n) := "x[i..i + n — 1] is a length-n power and the first occurrence of that 
power in x", as 

P(i, i + n - 1) A (W, 0<i'<i, ->P(i',i' + n - 1)). 

From Lemma [14| we know that the novel occurrences of length-n powers are clustered into 
a finite number of blocks. Then, as in the proof of Theorem [6] we can guess the endpoints 
of these blocks, and verify that the length-n factors beginning at the positions inside the 
blocks are novel occurrences of powers, while those outside are not, and sum the lengths of 
the blocks, using a finite automaton built from M. Thus, the function counting the number 
of length-n powers in x is fc-synchronizcd. 

The number of length-n primitive words in x is then also fc-synchronized, since it is 
expressible as the total number of words of length n, minus the number of length-n powers. 

► Remark. Using the technique above, we can prove analogous results for the functions 
counting the number of length-n words that are a-powers, for any fixed rational number 
a > 1. 

5 Unsynchronized sequences 

It is natural to wonder whether other aspects of fc-automatic sequences are always fc- 
synchronized. We give an example that is not. 

We say a word w is bordered if it has a nonempty prefix, other than w itself, that is also 
a suffix. Alternatively, w is bordered if it can be written in the form w = tvt, where t is 
nonempty. Otherwise a word is unbordered. 

Charlier et al. [8] showed that u x (n), the number of unbordered factors of length n of 
a sequence x, is fc-regular if x is fc-automatic. They also gave a conjecture for recursion 
relations defining ut(n) where t is the Thue-Morse sequence; this conjecture has recently 
been verified by Goc and Shallit. 

We give here an example of a fc-automatic sequence where the number of unbordered 
factors of length n is not fc-synchronized. 

Consider the characteristic sequence of the powers of 2: 

c:= 011010001000000010- •• . 

► Theorem 16. The sequence c is 2-automatic, but the function u c (n) counting the number 
of unbordered factors is not 2 -synchronized. 



Subword Complexity 



Proof. It is not hard to verify that c is 2-automatic and that c has exactly r + 2 unbordered 
factors of length 2 r +l, for r > 2 — namely the factors beginning at positions 2* for < i < r, 
and the factor beginning at position 2 r + 1. However, if u c (n) were 2-synchronized, then 
reading an input where the first component looks like CP10 r l (and hence a representation 
of 2 r_1 + 1) for large r would force the transitions to enter a cycle. If the corresponding 
transitions for the second component contained a nonzero entry this would force u c (n) to 
grow linearly with n when n is of the form 2 r + 1 . Otherwise, the corresponding transitions 
for the second component are just O's, in which case u c {n) is bounded above by a constant, 
for n of the form 2 r + 1. Both cases lead to a contradiction. M 
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